DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...90
Hits 1 – 20 of 1.784

1
An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
In: ISSN: 2375-4699 ; EISSN: 2375-4702 ; ACM Transactions on Asian and Low-Resource Language Information Processing ; https://hal.inria.fr/hal-03616853 ; ACM Transactions on Asian and Low-Resource Language Information Processing, ACM, In press, ⟨10.1145/3523179⟩ (2022)
BASE
Show details
2
Didactique de l’oral et interactions : rétrospective méthodologie et expérimentation en contexte algérien
In: ISSN: 0077-2712 ; EISSN: 1952-4250 ; Mélanges CRAPEL ; https://hal.archives-ouvertes.fr/hal-03630135 ; Mélanges CRAPEL, Centre de recherches et d'applications pédagogiques en langues, 2022, Enseignement du français parlé aujourd’hui : Recherches et expériences de terrain, Mélanges CRAPEL (43/1) ; https://www.atilf.fr/wp-content/uploads/publications/MelangesCrapel/Melanges_43_1_6_Cortier_et_al.pdf (2022)
BASE
Show details
3
Language identification, a tool for Corsican and for the evaluation of linguistic resources ; L'identification de langue, un outil au service du corse et de l'évaluation des ressources linguistiques
In: Traitement Automatique des Langues ; https://hal.archives-ouvertes.fr/hal-03633290 ; Traitement Automatique des Langues, 2022, Diversité Linguistique, 62 (3), pp.13-37 ; https://www.atala.org/content/diversité-linguistique-linguistic-diversity-natural-language-processing (2022)
Abstract: International audience ; The constitution of corpora is one of the first priorities faced by less-resourced languages. The emergence of Internet-based resources of increasing size and covering more and more languages may suggest that this issue has been resolved, but this is not the case. Following Caswell et al. (2021), who evaluated several large resources, including one with Corsican content, we conducted an analysis of two corpora including this language: An Crúbadán and W2C. In parallel to a manual evaluation, we considered the possibility of using one or more language identification modules to filter the content of these resources, which turns out to be possible but at the cost of low recall. For this task, we tested and re-trained various systems in order to adapt them to Corsican. This work makes it possible to provide a model allowing the identification of 17 European languages as well as Corsican ; La constitution de corpus est une des premières priorités que rencontrent les langues peu dotées. L’émergence de ressources issues d’Internet, de tailles de plus en plus imposantes et couvrant de nombreuses langues, peut laisser penser que ce point est désormais résolu, ce qui n’est pas le cas. À la suite de Caswell et al. (2021), qui ont évalué plusieurs ressources de grande envergure, dont une disposant de contenu corse, nous avons mené une analyse de deux corpus incluant cette langue : An Crúbadán et W2C. Parallèlement à une évaluation manuelle, nous avons estimé la possibilité d’utiliser un ou plusieurs modules d’identification de langue afin de filtrer le contenu de ces ressources, ce qui s’avère possible mais au prix d’un rappel peu élevé. Pour cette tâche, nous avons testé et réentraîné divers systèmes afin de les adapter au mieux au corse. Ce travail nous permet de mettre à disposition un modèle capable d’identifier le corse ainsi que 17 autres langues européennes.
Keyword: [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; corpora; corpus; corse; Corsican; identification de langue; language identification; langues peu dotées; less-resourced languages; qualité; quality
URL: https://hal.archives-ouvertes.fr/hal-03633290/file/TAL_62_3_1_Kevers_HAL.pdf
https://hal.archives-ouvertes.fr/hal-03633290/document
https://hal.archives-ouvertes.fr/hal-03633290
BASE
Hide details
4
Machine Translation and Gender biases in video game localisation: a corpus-based analysis
In: https://hal.archives-ouvertes.fr/hal-03540605 ; 2022 (2022)
BASE
Show details
5
NewsCom-NEG Corpus ...
BASE
Show details
6
NewsCom-NEG Corpus ...
BASE
Show details
7
Lothian Diaries Dataset 1 (May-September 2020) ...
Hall-Lew, Lauren. - : Edinburgh DataVault, 2022
BASE
Show details
8
Gendered body language in children’s literature over time ...
Čermáková, Anna; Mahlberg, Michaela. - : Apollo - University of Cambridge Repository, 2022
BASE
Show details
9
Measuring Terminology Consistency in Translated Corpora: Implementation of the Herfindahl-Hirshman Index
In: Information; Volume 13; Issue 2; Pages: 43 (2022)
BASE
Show details
10
Frequency, Informativity and Word Length: Insights from Typologically Diverse Corpora
In: Entropy; Volume 24; Issue 2; Pages: 280 (2022)
BASE
Show details
11
Text+: Language- and text-based Research Data Infrastructure ...
BASE
Show details
12
Text+: Language- and text-based Research Data Infrastructure ...
BASE
Show details
13
Text+: Language- and text-based Research Data Infrastructure ...
BASE
Show details
14
DCT vs. corpus orales: reflexiones metodológicas sobre el estudio de los actos de habla ; DCT vs. spoken corpora: methodologic reflections on the study of speech acts
In: Pragmalingüística, (29), 377-395 (2022)
BASE
Show details
15
Source language difficulties in learner translation: Evidence from an error-annotated corpus
Kunilovskaia, Mariia; Ilyushchenya, Tatyana; Morgoun, Natalia. - : John Benjamins Publishing, 2022
BASE
Show details
16
ANLIzing the Adversarial Natural Language Inference Dataset
In: Proceedings of the Society for Computation in Linguistics (2022)
BASE
Show details
17
Chinese Idioms: Stepping Into L2 Student’s Shoes
In: Acta Linguistica Asiatica, Vol 12, Iss 1 (2022) (2022)
BASE
Show details
18
Understanding corpus linguistics
Leibniz-Zentrum Allgemeine Sprachwissenschaft
Show details
19
A Navigation Tool for Exploring Semantic Web Corpora
In: Proceedings of the ISWC 2021 Posters, Demos and Industry Tracks ; https://hal.univ-lorraine.fr/hal-03485155 ; Proceedings of the ISWC 2021 Posters, Demos and Industry Tracks, Oct 2021, Virtual conference, France (2021)
BASE
Show details
20
A Semantic Web Navigation Tool for Exploring the Henri Poincaré Correspondence Corpus
In: Proceedings of the International Joint Workshop on Semantic Web and Ontology Design for Cultural Heritage ; https://hal.univ-lorraine.fr/hal-03406713 ; Proceedings of the International Joint Workshop on Semantic Web and Ontology Design for Cultural Heritage, Antonis Bikakis, Roberta Ferrario, Stéphane Jean, Béatrice Markhoff, Alessandro Mosca, Marianna and Nicolosi Asmundo, Sep 2021, Bolzano, Italy (2021)
BASE
Show details

Page: 1 2 3 4 5...90

Catalogues
6
0
5
0
0
0
1
Bibliographies
24
0
0
0
0
0
0
18
9
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
1.729
2
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern